091-2230-8145     |      dataprojectng@gmail.com

Generic Metadata Handling in Scientific Data Life Cycles

  • Project Research
  • 1-5 Chapters
  • Abstract : Available
  • Table of Content: Available
  • Reference Style: APA
  • Recommended for : Student Researchers
  • NGN 3000

Introduction

This chapter introduces the dissertation by describing its context, the identified challenges, how the chosen challenge was met, the achieved impact, relevant publications, and how the thesis is structured. 1.1 Context In science data is the essential focal point in todays computational and quantitative approaches to scientific knowledge gain. Computational simulations enable far reaching explorations of modeled realities while quantitative methods gather data to improve the understanding of observed phenomena. These methods are increasingly viable only via high-end storage and large-scale High Performance Computing resources with individual requirements dramatically rising. Data throughputs involve gigabytes per second continuously, volumes are of petabyte magnitude, continuous files per second rates are in the double-digit range, and a vast universe of complex data representations exists. The great potential of such data is evident by the current trend of Big Data in science that aims at large-scale information extraction to foster scientific discoveries. This is fundamentally enabled by intelligently handling data and by combining a large variety of information technology methods to so-called data life cycles. In principle, these consist of data sources, systems to manage data as well as compute resources, methods for access rights management, utilization interfaces and data sinks. Scientists are naturally focused on their particular research. Thus, metadata is an essential step forward in the efficiency of use as it enables managing data based on its content instead of location. Via specific data life cycles scientists are freed from the necessity to extensively deal with IT infrastructures while still utilizing them to drive their research by handling their extensive data and computing demands. In this complex technological environment, a plethora of significant challenges presents itself that hinders the advancement of the state-of-the-art in data-driven knowledge gain. 1.2 Challenges Vital challenges in managing data life cycles are manifold. Federated authentication and authorization infrastructures need to be integrated while being mindful of the overall resilience of increasingly complex data life cycles. The increasing numbers of files and data amounts need to be managed by Big Data systems. These in turn need to be efficiently integrated with High Performance Computing resources for analysis which signifies the need for advanced interoperability. Besides automated pre- and postprocessing, the user-friendly creation, and execution of workflows to encapsulate complex analysis procedures need to be supported. Integrated scientific environments need to be provided that hide the underlying complexity while enabling that use. Essential is also the building of trust that an infrastructure delivers 6 1. INTRODUCTION what it promises. Closely connected is moving from a fixed-term build up phase to a sustainable operation phase. As these goals are partly opposing to each other, a effective balance between them needs to be developed for each data life cycle. The dissertation focuses on the major challenge of the organization of large numbers of files in the million range using information about data, so-called metadata. Currently, solutions are often either use case specific or lacking completely, thus, preventing easy access and re-use. Without metadata, users have to remember where an individual file is located. With a large number of files this is inefficient if not impossible. This especially holds true for Big Data use cases with a large number of files with complex content and stored in distributed locations. Currently, significant efforts need to be made to implement even narrowly applicable and pragmatic metadata handling solutions for every new scientific experiment.




FIND OTHER RELATED TOPICS


Related Project Materials

COMMUNICATION STRATEGY FOR NON GOVERNMENTAL ORGANIZATIONS ON CHILD ADOPTION; A CASE STUDY OF ORPHANAGE HOMES IN LAGOS STATE

BACKGROUND OF THE STUDY

The emergence of non-governmental organizations (NGOs) in recent times has mot...

Read more
LEXICO-SEMANTIC NIGERIANISM IN NIGERIAN NEWSPAPERS

Abstract

The English language in Nigeria is older than the Nigerian nation. It was formally introduced in 1842 by the f...

Read more
DESIGN AND IMPLEMENTATION OF LOCAL GOVERNMENT PERSONNEL INFORMATION SYSTEM

Statement of Problem

Personnel management involves a lot of paper work and the consequence of this is that it is difficu...

Read more
GENDER ROLES AND TECHNOLOGY ADOPTION IN RUBBER PRODUCTION IN EDO STATE

ABSTRACT

This study examined the adoption of improved rubber production technologies by farmers in Edo...

Read more
AN EVALUATION OF THE IMPACT OF SUPERVISION AND CONTROL OF THE CENTRAL BANK ON THE PERFORMANCE OF COMMERCIAL BANKS

ABSTRACT

This research project tends to evaluate the impact of supervision and control of the Central Bank on the performance of commerci...

Read more
A COMPARATIVE STUDY OF EXPENDITURE CONTROL METHODS IN GOVERNMENT AND PRIVATELY OWNED HOSPITALS (A STUDY OF UNIVERSITY OF NIGERIA TEACHING HOSPITAL, ENUGU AND TORONTO HOSPITAL ONITSHA)

ABSTRACT

This research work on A Comparative Study of Expenditure Controls method in Government and private Hospitals is aimed at studyin...

Read more
THE IMPORTANCE OF COMMUNITY OF INQUIRY PHILOSOPHY FOR CHILDREN SYSTEM OF EDUCATION IN OUR SCHOOLS

ABSTRACT

The Important of Community of Inquiry Philosophy for children system of education (COPI4C) in...

Read more
ENTREPRENEURIAL EDUCATION AS A TOOL OF REDUCING UNEMPLOYMENT IN NIGERIA

Abstract

Education in Nigeria is devoid of the element crucial to averting the surging rate of unemployment in the count...

Read more
PLANNING AND IMPLEMENTATION OF QUALITY CONTROL IN A MANUFACTURING COMPANY

ABSTRACT

The broad objective of this study is to examine the impact of production planning and control...

Read more
PERCEIVED DIFFICULTIES OF SOME BIOLOGICAL CONCEPT BY SENIOR SECONDARY SCHOOL STUDENTS IN THE STUDY OF SOME BIOLOGICAL CONCEPT

Abstract

Biology is basic for understanding the complexities of life. This aspect of science for example, genetically ma...

Read more
Share this page with your friends




whatsapp